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ON THE CLASSIFICATION OF SYSTEMS OF LINEAR EQUATIONS. 



By ARTHUR RANUM, Cornell University. 



In a recent paper in The American Mathematical Monthly,* "On 
the Solution of a System of Linear Equations," Professor Miller has proved 
that in all the solutions of a system of consistent linear equations a particu- 
lar unknown xt will have the same value, that is, will satisfy an equation of 
the form bXi+l—0 (MO), if and only if the omission of the coefficients of 
Xi from the matrix of the System reduces the rank of the matrix. This the- 
orem immediately suggested to me a generalization in which several 
unknowns are involved instead of only one, and I soon found that the more 
general theorem so derived is a sufficient basis for, and naturally gives rise 
to, a complete classification of systems of consistent linear equations, 
not only with respect to the rank of their matrices, but also with respect to 
the rank of the partial matrices obtained by omitting the coefficients of one 
or more unknowns. In this note I wish to state and prove the theorem, 
explain the resulting classification, apply it to one or two simple cases, and 
point out its geometrical significance. 

Reduction in Rank. 

1. Consider any system T of m linear equations in n~s-\-t unknowns, 
as follows: 

( o 11 x 1 +...+o 1 i x s 4-ai, s +ix B +i-{-...+ai^Cn-\rk 1 —-0, 
T . . . . . . . • • • _■ 

\ t*7jtiy/i "T~. • • \ ClinsXs iCbm, s+1 ^s+1 ~T~ • • • "I QsmnXni fCm — v. 

Theorem. A necessary and sufficient condition that in all the solutions 
of a system of consistent linear equations the values of any particular set of 
unknowns x s +i, ..., x„, t in number, are connected by the same linear relation 
of the form 



•Vol. XVII (1910), p. 137. 
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(1) liXs+i + .-.+ltXa + h^O, 

in which the coefficients l lt ..., l t do not all vanish, is that the omission of the 
coefficients of these unknowns from the matrix of the system will reduce the 
rank of the matrix. 

2. Proof. That the condition is necessary is immediately evident, 
exactly as in the special case t=l, which Professor Miller considers. 
To show that the condition is also sufficient, we proceed by the method of 
induction under the assumption that the sufficiency of the condition holds 
for t—1 unknowns, we wish to show that it must also hold for t unknowns. 

Let us suppose, therefore, that the omission of the coefficients of 
ajg+i, ..., Xn from the matrix of the system causes a reduction in the rank of 
the matrix. If it happens that the omission of the coefficients of t—1 
of these unknowns would alone suffice to reduce the rank of the matrix, 
then by the assumption just made these t—1 unknowns would be connected 
by a linear relation and the sufficiency of the condition would be proved. 

Hence we may confine ourselves to the case in which a reduction of 
rank is not caused by the omission of the coefficients of x s +2, ..., x n , say, but 
only by the further omission of the coefficients of x s +i. Let r, i s+1, be the 
rank of the matrix of the system; then the partial matrix A, formed by the 
coefficients of x u ..., x s , x s +i is also of rank r, while the matrix A' formed 
by the coefficients of #i, ..., x s is of rank r— 1. Consequently there must 
exist in A x at least one non-vanishing r-rowed determinant l x whose last 
column involves coefficients of x s +\. There is no loss of generality in taking 
as the elements of this determinant the coefficients of x lt ..., x r -i, x s +i in the 
first r equations of the system. That is, 

(2) ^— 5 in,! ... a r —\ , r-i a r , s+i7^0. 

Let us now substitute for x s +i, x n in the equations of the system 

T any set of values that satisfy them; the resulting system T in the s un- 
knowns x u ..., x s will be consistent; and since the rank of its matrix A' is 
r— 1, the rank of its augmented matrix 



B'~ 



a xx ...a xs «i, stiX s +i+. ..+«!« Xn-^ky 

&mi***Clms Cvm, s-r-l^s-l-1 ~T~ • • • \~Cbnm Xn\r£w 



is also t— 1*; that is, every r-rowed determinant of B' must vanish. Con- 
sider the determinant a common to the first r rows and the first r— 1 col- 

*Cf. Capelli, Rivista di Matematica, Vol. 2 (1892), p. 54. English statements of Capelli's theorem are to be 
found in Bocher's Introduction to Higher Algebra, p. 46, Theorem 1, and in Miller's paper cited above. 
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umns together with the last column of B'. Since the elements of the last 
column of a are linear functions of x s +i, ..., x n , a itself can be expressed 
as a linear function of the same variables, and their coefficients are r-rowed 
determinants involving the elements of the augmented matrix of the origin- 
al system T. Thus 

a =l 1 x s +i+... +l t Xn+h=0; 

and l u being precisely the determinant of equation (2), does not vanish. 
We have now shown that all the values of x a +i, ..., x n that satisfy the equa- 
tions of the system T are connected by the same linear relation of 
the required form; therefore the theorem is proved, provided we can estab- 
lish it for the case t=l. 

But it is evident that the method we have used for passing from t— 1 
to t is also applicable to the case t -1, and so affords an independent proof 
of Professor Miller's theorem, besides completing the proof of this. 

3. A number of corollaries are immediately deducible. For instance, 
if the unknowns x s +i, ..., x n are unrestricted except by a single linear rela- 
tion, or in other words, if in the solutions of the system T the values of 
these unknowns are precisely those that satisfy one linear relation of the 
form (1), then the rank of the matrix will be reduced exactly one unit by 
the omission of the coefficients of these unknowns, and conversely. On the 
other hand, if there exist i (and only i, say) independent and compatible 
linear relations connecting the t unknowns x a +i, ..., x n , then the rank r of 
the matrix will be reduced exactly i units, and conversely; evidently i ~ t 
and i^r; if i—t, each of the t unknowns is restricted to a single value and 
the rank is reduced one unit by the omission of the coefficients of each of 
the t unknowns. 

Moreover, if the relation (1) actually involves all of the t unknowns 
Xs+i, ..., x n , that is, if k^O (i—1, ..., t), and if there is no other linear rela- 
tion between them, then the omission of the coefficients of all of these 
unknowns, and not merely of some of them, will reduce the rank of 
the matrix, and the amount of the reduction is one unit. The converse again 
holds. 

Similarly, if the t unknowns are connected by % independent and com- 
patible linear relations that actually involve all the unknowns, then the re- 
duction in rank due to the omission of the coefficients of all of these t un- 
knowns is equal to i, while the reduction due to the omission of the coeffi- 
cients of a part of the unknowns is less than i, and conversely. 

Classification. 

4. Let us now consider the bearing of this theorem and its corollaries 
on the classification of systems of linear equations. Let T be a consistent 
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system of rank r in n unknowns. After a suitable permutation of 
the unknowns among themselves, the complete solution of T may be written 
in the form 

s ] ^ . . . . . 

which means that all the solutions of T are obtainable by giving all possible 
values to x r +i, ..., x n and for each set of values of these unknowns finding 
the corresponding values of the other unknowns x u ..., x r from the equa- 
tions S. 

Each of the equations S is of the form of equation (1), since the coef- 
ficients of x lt ..., x r cannot vanish. But the remaining coefficients Uj may 
some or all of them vanish, and if we classify the systems T with respect to 
the vanishing or non-vanishing of these coefficients, that is, by determining 
which of the unknowns av+i, ..., x n are actually involved in the various 
equations of S, the different classes so obtained (see § 5) will be 
distinguished from one another and characterized by the particular sets of 
unknowns, the omission of whose coefficients from the matrix of T 
will cause reductions in the rank of the matrix. 

Since no two of the equations S involve the same unknowns, those on 
the left being different in every case, therefore the reduction in rank caused 
by the omission of the coefficients of (a) all, or (b) a part of the unknowns 
actually involved in any one of the equations of S is equal to (a) one unit or 
(b) zero units. 

We define the symbol R Xa ... xx as the number of units of reduction in 
rank that is caused by the omission of the coefficients of x a , ..., X\ from the 
matrix of T; and we understand in every case by an equation of the form 
Rx a ... x X = 3, that the omission of the coefficients of a part of these unknowns 
will reduce the rank less than j units, except when i=0. As an illustration, 
let T be a system of rank 3 in five unknowns whose complete solution S is 
of the form 

Jj i h \ 1X4 I t \ 2 J(/ 5 [~ Yb \ 

x s = h 3 

where 1 1U l lt , l 2l are all different from zero; then the class of systems to 
which T belongs will be characterized by the equations R Xl Xi x,~Rx.. » 4 = 
R x , = 1 ; moreover, 

Kxi x-t -fVx 2 -fVaj 4 ", ttxiXvXiXc ttx* x 3 Xi ^, anu HxiXnXzX^x^^O, 
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5. Now the number of ways in which the r(n—r) coefficients kj in the 
equations S can be chosen with respect to their vanishing or not is 2 r(TC_r) . 
But these do not all give essentially distinct classes of systems T. For if 
two such choices differ only in the order in which the unknowns are 
arranged, that is, if one can be obtained from the other by permuting either 
the n— r unknowns av+i, ..., *» on the right or the r unknowns x,, ..., x r , on 
the left, then they give rise to essentially the same class; otherwise to dis- 
tinct classes. 

Let N n , r denote the number of classes of systems of consistent linear 
equations of rank r in n unknowns. Then N n . r is equal to the total number 
of combinations of r(n—r) things arranged in the form of a rectangle with 
r rows and n—r columns, provided that two combinations be regarded as 
identical, if one can be obtained from the other by permuting the rows of 
this rectangle or the columns or both. Obviously, Nn, r=N n , n -r. 

Query. Is there a convenient formula, old or new, giving the value 
of Nn, r in terms of n and r? 

For a few of the smaller values of n and r it is not difficult to find the 
corresponding values of N n , r , as follows: 
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Geometrical Interpretation. 

6. If we interpret x lt ..., x» as the Cartesian coordinates of a point in a 
space Sn of n dimensions, * then a system T of consistent linear equations of 
rank r in these variables will stand for a system of linear (flat) spaces S n ~i 
intersecting in a linear space S n -r', and the different classes we have been 
considering will correspond to the different degrees of parallelismf that can 
exist between S n - r and the coordinate axes, coordinate planes, etc. 

On the other hand, if the equations of T are homogeneous in the n 
variables, in which case they are always consistent, then another geometric- 
al interpretation is possible. Namely, let the variables be the homogeneous 
coordinates of a point in a space S n -i, referred to a coordinate simplex. J T 
will now stand for a system of linear spaces £,1-2 interacting in a linear space 

*Cf. Schoute, Mekrdimensionale Geometrie, Vol. 1, pp. 29, 125, and 137. 
fSchoute, 1. c, p. 34. 
$Sehoute, 1. c, p. 142. 



160 

S n -r-i, except when r=n; in the latter case the spaces have no point 
in common. The classification we have been considering will correspond to 
the different degrees of intersection, including coincidence and non-intersec- 
tion, that exist between S n - r-i and the vertices, edges, faces, etc., of the 
coordinate simplex. 



Illustrations. 

7. In order to illustrate the general theory, a few of the simpler spe- 
cial cases will now be mentioned. 

First, we put n=- 3 and enumerate the different classes of consistent 
systems T of linear non-homogeneous equations in the three variables x,'y, z 
for each possible value of the rank r. Geometrically, T is a system 
of planes having at least one point in common, and referred to Cartesian 
axes, retangular or oblique, in ordinary space of three dimensions. 
Let Sa-r be the locus common to these planes. 

r=3. Every system T has one solution. The number of classes of 
systems is N 3 , s =1. This one class is characterized by the equations R X =R V 
—R z =l, or by the single equation R xvz =3. The common locus S of the 
planes is a finite point. 

t—2. T has oo 1 solutions. The number of classes is Na, 2 =3. Call 
these classes (2,), (2 2 ), and (2 3 ), respectively. 

(2,) Rxz=R 1/ z=l. Moreover, Rx V —l. 

The common locus Si is a (finite) line not parallel* to a coordinate 
plane. 

(2 2 ) Rx— Ryt—1. St is a line parallel to the plane x—0, but not par- 
allel to a coordinate axis. 

(2 S ) Rx=R v ~l or R xv =2. S t is a line parallel to the z-axis. 

r=l. T has 00 2 solutions. The number of classes is N s , 1 =3. 

(11) R xyz =l. Ss, is a plane not parallel to a coordinate axis. 

(1 2 ) R xv =l. S 2 is a plane parallel to the 2-axis, but not parallel to a 
coordinate plane. 

(1 3 ) Rx=\. S2 is a plane parallel to x=0. 

8. As a second illustration we take homogeneous equations and put 
n~ 4. Let x, y, z, w, be the homogeneous coordinates of a point in three- 
dimensional space referred to a coordinate tetrahedron. Tis again a system 
of planes and £>3--r is their common locus, except when r=4. 

r=A. One solution, x=y—z=w=0. Ni,i=l. 
i?.r=-B 2/ = J B 2 =jBw=l. No common locus. 
r=3. 00 x solutions. A^4, 3 =4. 

(3 , ) R X w =R yw —Rz m —l. S is a (finite or infinite) point not lying in 
a coordinate plane. 

*It is to be understood here that two parallel lines or planes may, in particular, be coincident lines or planes, 
and that a line parallel to a plane may be a line lying in the plane. 



161 

(3 8 ) R x —R V w=Rzw~-=l. S is a point lying in the plane x—Q, but not 
in a coordinate axis. 

(3 3 ) Rx^Ry^Rzw—1. S is a point in the line x y=0, but not coin- 
ciding with a coordinate vertex. 

(3 4 ) R x —R y —-R z =l. S is the point x=y=z=0. 
r=2. oo 2 -solutions. Nt, 2=7. 

(2, ) Rxm, : =Rviw=l. Si is a line not meeting a coordinate axis (either 
in a finite or an infinite point) . 

(2 4 ) R X w—R v ^w=l. S x is a line meeting a3=w=0, but not meeting any 
other coordinate axis. 

(2 3 ) R xw =R yz =l. Si is a line meeting the two opposite coordinate 
axes x =w=0 and y =z - 0, but not lying in a coordinate plane. 

(2 4 ) R x =Ryzw— 1. Si is a line lying in the plane x=0, but not pass- 
ing through a coordinate vertex. 

(2 5 ) Rxz-Ryz 1. »Si is a line passing through the point a:=2/=z=0, 
but not lying in a coordinate plane. 

(2 e ) Rx=R V z=l. <S\ is a line passing through the point x==y=s~-0, 
and lying in the plane x = 0, but not coinciding with a coordinate axis. 

(2 7 ) #* ^.Ry -1. S, is the line x =y —0. 

r=l. co 3 solutions. JV4, 1 =4. 

(1 , ) Rxyzw 1. S is a plane not passing through a coordinate vertex. 

(1 8 ) i2aa«=l. St is a plane passing through the point x=2/=z=0, but 
not containing a coordinate axis. 

(1 8 ) Rx y =l. St is a plane containing the line x=y=0, but not coin- 
ciding with a coordinate plane. 

(1 4 ) R x — 1. S 2 is the plane x=0. 

9. It may be worth while to add the four-dimensional interpretation 
of the seven classes (2,), ..., (2 7 ) just given, for which n=4 and r=2, 
under the supposition that the equations of T are again non-homogeneous. 

(2J S 2 is a plane having neither complete nor partial parallelism to 
any coordinate plane. 

(2 2 ) St is a plane half parallel to x=w=0, but neither completely 
parallel nor half parallel to any other coordinate plane. 

(2 3 ) S 2 is a plane half parallel to the two opposite coordinate planes 
x--=w=0 and y—z=0, but not (completely) parallel to a coordinate space S s . 

(2 4 ) St is a plane parallel to the space x—0, but not parallel to a co- 
ordinate axis. 

(2 5 ) S a is a plane parallel to the line x~y=z=0, but not parallel to a 
coordinate space. 

(2 e ) S 2 is a plane parallel both to the line x=y— z=0 and to the space 
x=0, but not completely parallel to a coordinate plane. 
(2 7 ) S 2 is a plane completely parallel to x=y=0. 



